Performance and limitations of the linguistically motivated Cocoa/Peaberry system in a broad biological domain
نویسندگان
چکیده
We tested a linguistically motivated rulebased system in the Cancer Genetics task of the BioNLP13 shared task challenge. The performance of the system was very moderate, ranging from 52% against the development set to 45% against the test set. Interestingly, the performance of the system did not change appreciably when using only entities tagged by the inbuilt tagger as compared to performance using the gold-tagged entities. The lack of an event anaphoric module, as well as problems in reducing events generated by a large trigger class to the task-specific event subset, were likely major contributory factors to the rather moderate performance.
منابع مشابه
RelAgent: Entity Detection and Normalization for Diseases in Clinical Records: a Linguistically Driven Approach
We refined the performance of Cocoa/Peaberry, a linguistically motivated system, on extracting disease entities from clinical notes in the training and development sets for Task 7. Entities were identified in noun chunks by use of dictionaries, and events (‘The left atrium is dilated’) through our own parser and predicate-argument structures. We also developed a module to map the extracted enti...
متن کاملOPTIMAL DESIGN OF ARCH DAMS FOR FREQUENCY LIMITATIONS USING CHARGED SYSTEM SEARCH AND PARTICLE SWARM OPTIMIZATION
In recent years, the importance of economical considerations in the field of dam engineering has motivated many researchers to propose new methods for minimizing the cost of dames and in particular arch dams. This paper presents a method for shape optimization of double curvature arch dams corresponding to minimum construction cost while satisfying different constraints such as natural frequenc...
متن کاملCocoa: Extending a Rule-based System to Tag Disease Attributes in Clinical Records
We extended Cocoa/Peaberry, our (RelAgent) existing rule based entity and event tagger, to tag attributes associated with diseases in clinical records. The boolean attributes of Negation, Uncertainty and Conditional were handled by an extension of the NegEx algorithm. The multi-valued Course and Severity attributes were detected either within the extended disease spans as output by the system, ...
متن کاملUsing linguistically-defined specific details to detect deception across domains
Current automatic deception detection approaches tend to rely on cues that are based either on specific lexical items or on linguistically abstract features that are not necessarily motivated by the psychology of deception. Notably, while approaches relying on such features can do well when the content domain is similar for training and testing, they suffer when content changes occur. We invest...
متن کاملTowards a Task-Based Assessment of Professional Competencies
Performance assessment is exceedingly considered a key concept in teacher education programs worldwide. Accordingly, in Iran, a national assessment system was proposed by Farhangian University to assess the professional competencies of its ELT graduates. The concerns regarding the validity and authenticity of traditional measures of teachers' competencies have motivated us to devise a localized...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013